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Abstract 

The present paper is part of a broader programme, exploring the possibility of 
involving the Microsoft Kinect"’"'^ sensor in the analysis of human motion. In this 
study, the output obtained from the two available versions of this sensor is critically 
examined. We demonstrate that the two outputs differ in regard to the variation 
of the physical quantities involved in the modelling of the human motion. As the 
original sensor has been found unsuitable for applications requiring high precision, 
the observed differences in the output of the two sensors call for the validation of 
the upgraded sensor on the basis of a marker-based system. 
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1 Introduction 


Microsoft Kinect"'"'^ (hereafter, simply ‘Kinect’) [1], a low-cost, portable motion¬ 
sensing hardware device, was developed by the Microsoft Corporation (Mi¬ 
crosoft, USA) as an accessory to the Xbox 360 video-game console (2010). 
The sensor is a webcamera-type, add-on peripheral device, enabling the oper¬ 
ation of Xbox via gestures and spoken commands. In 2011, Microsoft released 
the software-development kit (SDK) for Kinect, thus enabling the development 
of applications in several standard programming languages. The hrst upgrade 
of the sensor (‘Kinect for Windows v2’), both hardware- and software-wise, 
tailored to the needs of Xbox One, became available (in July 2014) for general 




development and use. The present paper is part of our research programme, 
aiming at investigating the possibility of involving either of the Kinect sensors 
in the analysis of motion data of subjects walking or running ‘in place’ (e.g., 
on commercially-available treadmills). If successful, Kinect could become an 
interesting alternative to marker-based systems (MBSs) in capturing data for 
motion analysis, one with an incontestably high bene£t-to-cost ratio. 

It is rather surprising that only one study, addressing the possibility of involv¬ 
ing Kinect in the analysis of walking and running motion (i.e., not in a static 
mode or in slow motion), has appeared so far [2]. Using similar methodology 
to the one followed herein, the authors in that study came to the conclusion 
that the original sensor is unsuitable for applications requiring high preci¬ 
sion. Of course, it remains to be seen whether any improvement in the overall 
quality of the output can be obtained with the upgraded sensor. The present 
study investigates the similarity of the output of the two sensors; in case of 
significant differences, the validation of the upgraded sensor, on the basis of a 
marker-based system (MBS), would be called for. 

The detailed description of the methodology we follow herein may be found in 
Ref. [3]; we will refer to that paper whenever necessary. The present paper has 
been organised in five sections. In Section 2, we provide a short description of 
the hardware used in our study and give a summary of our methodology in 
the analysis of the human-motion data. In Section 3, we give details on the 
experimental part of the study. The results, obtained from the analysis of the 
data of Section 3, are presented in Section 4. We discuss the main conclusions 
of the paper in the last section. 


2 Data acquisition and analysis 


2.1 The Kinect sensors 


In the original sensor, the skeletal data (‘stick figure’) of the output comprises 
20 time series of three-dimensional (3D) vectors of spatial coordinates, i.e., 
measurements of the {x,y,z) coordinates of the 20 nodes which the sensor 
associates with the axial and appendicular parts of the human skeleton. In 
coronal (frontal) view of the subject (sensor view), the Kinect coordinate 
system is defined with the x axis (medial-lateral) pointing to the left (i.e., to 
the right part of the body of the subject being viewed), the y axis (vertical) 
upwards, and the 2 ; axis (anterior-posterior) away from the sensor. 

The nodes 1 to 4 are main-body nodes, identified as IIIP_CENTER, SPINE, 
SHOULDER_CENTER, and HEAD. The nodes 5 to 8 relate to the left arm: 
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SHOULDERXEFT, ELBOWXEFT, WRIST_LEFT, and HANDXEFT; sim¬ 
ilarly, the nodes 9 to 12 on the right arm are: SHOULDER_RIGHT, EL- 
BOW_RIGHT, WRIST_RIGHT, and HAND_RIGHT. The eight remaining 
nodes pertain to the legs, the hrst four to the left (HIP_LEFT, KNEE_LEFT, 
ANKLE_LEFT, and FOOT_LEFT), the remaining four to the subject’s ri^t 
(HIP_RIGHT, KNEE_RIGHT, ANKLE_RIGHT, and FOOT_RIGHT) leg0. 

In the upgraded sensor, some modihcations have been made in the nam¬ 
ing (and placement) of some nodes. The original node HIP_GENTER has 
been replaced by SPINE_BASE (and appears slightly shifted downwards); 
the original node SPINE has been replaced by SPINE_MID (and appears 
slightly shifted upwards); hnally, the original node SHOULDER_GENTER 
has been replaced by NEGK (and also appears slightly shifted upwards). 
Five new nodes have been appended at the end of the list, one of which 
is a body node (SPINE_SIIOULDER, node 21), whereas four nodes per¬ 
tain to the subject’s hands, HAND_TIP_LEFT (22), THUMB_LEFT (23), 
HAND_TIP_RIGHT (24), and THUMB_RIGHT (25). 

In both versions, parallel to the captured video image, Kinect acquires an 
infrared image; captured with a GGD camera, this infrared image enables the 
extraction of information on the depth 2 ;. The sampling rate in the Kinect 
output (for the video and the skeletal data, for both versions of the sensor) is 
30 Hz. 


2.2 Stray motion 


The subject’s motion on the treadmill is split into two components: the motion 
of the subject’s ‘centre of mass’ (GM), which should be considered as one 
reference point, moving synchronously with the subject’s physical GM, and 
the motion of the subject’s body parts relative to the GM. The coordinates of 
the GM are extracted from the main-body, the shoulder, and the hip nodes. 
Prior to further processing, the GM offsets (xcm,2/c'mGcm) are removed from 
the data at all times; thus, the motion is dehned relative to the subject’s GM. 
The largeness of the subject’s ‘stray’ motion on the treadmill is assessed on 
the basis of the root-mean-square (rms) of the distributions oi xcmi Vcm, and 
ZCM- 


^ The subject’s left and right parts refer to what the subject perceives as the left 
and right parts of his/her body. 
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2.3 Period of the gait cycle 


Ideally, the period of the gait cycle T is dehned as the time lapse between suc¬ 
cessive time instants corresponding to identical postures of the human body 
(position and direction of motion of the human-body parts with respect to 
the CM). (Of course, the application of ‘identicalness’ in living organisms is 
illusional; no two postures can ever be expected to be identical in the for¬ 
mal sense.) We dehne the period of the gait cycle as the time lapse between 
successive most distal positions of the same lower leg, which is identihed 
herein as the ankle; one could also use the midpoints of the ankle and foot 
nodes, yet our past experience with the original sensor suggests caution with 
the foot nodes due to frequent artefacts, in particular in dorsal positions. The 
arrays of the time instants, at which the left or right lower leg is at its most 
distal position with respect to the instantaneous CM of the subject, are used 
in timing the motion of the left or right part of the human body. 

The period of the gait cycle is related to two other quantities which are used 
in the analysis of motion data. 

• The stride length L is the product of the velocity v and the period of the 
gait cycle: L = vT. 

• The cadence C is dehned as the number of steps per unit of time; a commonly- 
used unit is the number of steps per min. 


2.4 Waveforms 


Using the time-instant arrays from the analysis of the left and right lower-leg 
signals (as described in Subsection 2.3), each time series (involving one specihc 
node and spatial direction) was split into one-period segments, which were 
subsequently superimposed and averaged, to yield a representative movement 
for the node and spatial direction over the gait cycle. Average waveforms for all 
nodes and spatial directions, representing the variation of the motion of that 
node (in 3D) within the gait cycle, were extracted separately for the left and 
right nodes of the extremities; waveforms were also extracted for the important 
angles introduced in Subsection 3.1 of Ref. [3]. In case that left/right (L/R) 
information is not available (as, for example, in the case of the trunk angle), 
the right lower leg was used in the timing. All waveforms were subsequently 
0-centred; the removal of the average offsets is necessary, given that the two 
systems yield output which cannot be thought of as corresponding to the same 
anatomical locations. 

The left and right waveforms yield two new waveforms, identihed as the ‘L/R 
average’ (LRA) and the ‘right-minus-left difference’ (RLD); the L/R differ- 
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ences in the output must be investigated as we intend to use Kinect in order 
to extract the asymmetrical features in the motion. 


2.5 Scoring options when comparing waveforms 


The similarity of corresponding waveforms is judged on the basis of hve scoring 
options: Pearson’s correlation coefficient, the Zilliacus error metric, the RMS 
error metric. Whang’s score, and Theil’s score. Assuming that a (0-centred) 
waveform, obtained from the original sensor, is denoted by ki and the corre¬ 
sponding (0-centred) waveform from the upgraded sensor by g^, the hve scoring 
options are dehned in Eqs. (8)-(12) of Ref. [3] (where alternative ways for test¬ 
ing the similarity of the output of different systems are also discussed). All 
waveforms are obtained herein using 50 histogram bins in the data processing. 

At hxed velocity of the treadmill belt, we will investigate the variation of 
these scores for the nodes and spatial directions of the extremities. Tests will 
be performed on the LRA waveforms, as well as on those corresponding to 
the RLD waveforms. After studying the goodness of the association of the 
waveforms at hxed velocity, we will investigate velocity-dependent ehects. 


3 Data acquisition 


The data acquisition involved one male adult (ZHAW employee), with no 
known motion problems, walking and running on a commercially-available 
treadmill (Horizon Laufband Adventure 5 Plus, Johnson Health Tech. GmbH, 
Germany). The placement of the treadmill in the laboratory of the Institute of 
Mechatronic Systems (School of Engineering, ZHAW), where the experimen¬ 
tation took place, was such that the subject’s motion be neither hindered nor 
inhuenced in any way by close-by objects. Prior to the data-acquisition ses¬ 
sions, the Kinect sensors were properly centred and aligned. The sensors were 
then left in the same position, untouched throughout the data acquisition. 

The original sensor also provides information on the elevation (pitch) angle at 
which it is set. During our extensive tests, we discovered that this information 
is not reliable, at least for the particular device we used in our experimen¬ 
tation. To enable the accurate determination of the elevation angle of the 
sensor, we set forth a simple procedure. The subject stands (in the upright 
position, not moving) at a number of positions on the treadmill belt, and static 
measurements (e.g., 5 s of Kinect data) at these positions are obtained and 
averaged. The elevation angle of the sensor may be easily obtained from the 
slope of the average (over a number of Kinect nodes, e.g., of those pertain- 
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ing to the hips, knees, and ankles) {ViZ) coordinates corresponding to these 
positions. The output data, obtained from the original sensor, were corrected 
(off-line) accordingly, to yield the appropriate spatial coordinates of the Kinect 
nodes in the ‘untilted’ coordinate system. To prevent the original sensor from 
re-adjusting the elevation angle during the data acquisition (which is a prob¬ 
lematic feature), we attached its body unto a plastic structure mounted on 
a tripod. Mounted on the same structure, about 10 cm above the original 
sensor, was the upgraded sensor. A dedicated correction for the tilt effects in 
the output of the upgraded sensor (which does not provide information on the 
elevation angle) was performed as aforementioned. The data from the origi¬ 
nal sensor were corrected by (a rotation around the x axis by) 6.1°, whereas 
those from the upgraded sensor by 7.2°. These corrections were conhrmed 
by measurements taken with a laser distance meter. Interestingly, the value 
of the elevation angle, obtained from the SDK of the original sensor in this 
conhguration, was only 5°. 

It is worth mentioning that, as we are interested in capturing the motion of 
the subject’s lower legs (i.e., of the ankle and foot nodes), the Kinect sensors 
must be placed at such a height that the number of lost lower-leg signals be 
kept reasonably small. Our past experience dictated that the Kinect sensor 
be placed close to the minimal height recommended by the manufacturer, 
namely around 2 ft off the (treadmill-belt) floor. Placing the sensor higher 
(e.g., around the midpoint of the recommended interval, namely at 4 ft off the 
treadmill-belt floor) leads to many lost lower-leg signals (the ankle and foot 
nodes are not tracked), as the lower leg is not visible by the sensor during a 
sizeable fraction of the gait cycle, shortly after toe-off (TO). 

The Kinect sensor may lose track of the lower parts of the subject’s extremities 
(wrists, hands, ankles, and feet) for two reasons: either due to the particularity 
of the motion of the extremity in relation to the position of the sensor (e.g., 
the identihcation of the elbows, wrists, and hands becomes problematic in 
some postures, where the viewing angle of the ulnar bone by Kinect is small) 
or due to the fact that these parts of the human body are obstructed (behind 
the subject) for a fraction of the gait cycle. Assuming that these instances 
remain rare (e.g., below about 3% of the available data in each time series, i.e., 
one frame in 30), the missing values may be reliably obtained (interpolated) 
from the well-determined (tracked) data. Although, when normalised to the 
total number of the available values, the untracked signals usually appear 
‘harmless’, particular attention was paid in order to ensure that no node be 
signihcantly affected, as in such a case the interpolation could have yielded 
unreliable results. 

Five velocities were used in the data acquisition: walking-motion data were 
acquired at 5 km/h; running-motion data at 8, 9, 10, and 11 km/h. At each 
velocity setting, the subject was given 1 min to adjust his movements comfort- 
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ably to the velocity of the treadmill belt. The Kinect output spanned slightly 
more than 2 min at each velocity. The variation of the distance between the 
subject and the Kinect sensors was monitored during the data acquisition; it 
ranged between about 2.5 and 2.9 m, i.e., well within the limits for the use of 
the sensors set by the manufacturer. The recording on the two measurement 
systems started simultaneously. 

Regarding the subject’s ‘stray’ motion, the rms average values (over all veloc¬ 
ities) of the xcM) ycM, and zcm distributions from the original sensor were: 
21.4, 16.9, and 33.6 mm; the corresponding values from the upgraded sensor 
were: 20.6, 19.1, and 32.1 mm. Averages of the period of the gait cycle T, of 
the cadence (7, and of the stride length L are given in Table 1, separately for 
the two measurement systems at all velocities; the agreement between corre¬ 
sponding values is very satisfactory. 


4 Comparison of the results obtained from the two Kinect sensors 


We commence with the LRA waveforms. The goodness of the association of the 
waveforms obtained with the two measurement systems for the eight node lev¬ 
els of the extremities (SHOULDER, ELBOW, WRIST, HAND, HIP, KNEE, 
ANKLE, and FOOT) and spatial directions was investigated as follows. Sep¬ 
arately for each of the hve scoring options of Section 2.5, velocity, and spatial 
direction, each node level was ranked according to the goodness of the asso¬ 
ciation between the waveforms obtained from the two measurement systems. 
The node level with the worst association was given the mark of 0, whereas 
the one with the best association was assigned the mark of 7. The sum of the 
ranking scores over all velocities and scoring options yielded an 8 x 3 ‘matrix 
of goodness of the association’ (8 node levels of the extremities, 3 spatial di¬ 
rections); the minimal entry in this matrix may be as small as 0 (which, in 
fact, was the overall score of the hips in the 2 ; direction!), whereas the maximal 
value may be as large as7x5x5 = 175 (the maximal value was obtained for 
the foot nodes in the 2 ; direction, a score of 173). It was found that the nodes 
with the worst association (henceforth, NWA) between the two measurement 
systems correspond to the shoulders (in all three spatial directions) and the 
hips (in the x and 2 ; directions); the knees in the y direction complete the first 
quartile of the similarity-index distribution. The average value of Pearson’s 
correlation coefficient for the NWA was found equal to 0.464 and the rms of 
the distribution to 0.387. 

We subsequently pursued the investigation of systematic differences in the 
performance of the sensors regarding: a) the upper and lower parts of the hu¬ 
man body (i.e., upper- versus lower-extremity nodes) and b) the three spatial 
directions. 
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• To assess the similarity of the waveforms, obtained from the two measure¬ 
ment systems for the nodes of the upper and lower extremities, one-factor 
ANOVA tests were performed, separately for each of the hve scoring options 
of Subsection 2.5, on the scores extracted at each velocity, for all upper- 
extremity nodes and spatial directions, and all lower-extremity nodes and 
spatial directions. Assuming a signihcance level pmm = 1% (the value which 
most statisticians adopt as the outset of the statistical signihcance), none 
of these tests resulted in signihcant effects, at any velocity; the minimal 
p-value from these tests was about 0.250. The exclusion of the scores of the 
NWA does not affect the results signihcantly, except at 11 km/h where the 
resulting p-value dropped to 3.39 • 10“^ (which, nonetheless, exceeds the 
adopted pmm level). We thus conclude that the analysis cannot support any 
signihcant diherences in the performance of the two sensors for the upper 
and lower extremities. 

• We next addressed the goodness of the association of the waveforms with 
regard to the three spatial directions x, y, and 0 . Although the correspond¬ 
ing ANOVA tests did not reveal signihcant ehects (except at 8 km/h, where 
the p-value was 5.19 • 10“^), the p-values were generally small, i.e., close 
to our pmin threshold. After the exclusion of the scores of the NWA, all 
running-motion data yielded p-values below Pmm- The best-matching wave¬ 
forms correspond to the depth the average values of Pearson’s correlation 
coefficient in the x, y, and z directions were found to be: 0.828, 0.834, and 
0.973, respectively. 

It must be mentioned that pronounced velocity-dependent ehects were ob¬ 
served in the overall similarity of the output of the two sensors. The asso¬ 
ciation of the waveforms, obtained from the two Kinect sensors, appears to 
deteriorate with decreasing velocity; for instance, the average value (over all 
the nodes and spatial directions) of Pearson’s correlation coefficient dropped 
(almost linearly) from 0.879 (at 11 km/h) to 0.609 (at 5 km/h). The exclusion 
of the scores of the NWA increases the overall similarity, leaving its velocity 
dependence almost intact; for instance, the average value of Pearson’s correla¬ 
tion coefficient dropped (almost linearly) from 0.947 (at 11 km/h) to 0.788 (at 
5 km/h). Similar conclusions were drawn after examining the results obtained 
with the other four scoring options. We have not found a plausible explana¬ 
tion of such a velocity dependence in the similarity of the output of the two 
sensors. 

We also compared the y waveforms of the lower legs. This comparison is in¬ 
teresting for two reasons. First, the lower-leg signals are used in timing the 
motion; second, we intend to use these signals in order to obtain the times 
(expressed as fractions of the gait cycle) of the initial contact (IC) and the 
TO [4,5]; the difference of these two values is the stance fraction. Shown in 
Figs. 1 and 2 are the waveforms obtained from the outputs of the two mea¬ 
surement systems, separately at each velocity. The waveforms represent the 



variation of the raw signals, i.e., the y offsets of the subject’s CM have not 
been removed. The salient feature in the waveforms obtained with the original 
sensor is a pronounced peak appearing around the IC; this peak is reduced in 
the waveforms obtained with the upgraded sensor. 

We also investigated the goodness of the association of the RLD waveforms. 
To this end, two-sided t-tests were performed on the score distributions be¬ 
tween corresponding LRA and RLD waveforms, a total of 75 tests: hve scoring 
options, three tests per scoring option (paired, homoscedastic, and unequal- 
variance), and hve velocities. The p-values, obtained from the majority of 
these tests for the running-motion data, were found to be small, below Pmin] 
the median p-value was equal to 2.36 • 10“^, whereas the minimal one was 
2.53 • 10“®. (After the exclusion of the scores of the NWA, the median p-value 
dropped to 1.10 • 10“^, whereas the minimal one to 5.69 • 10“®.) The anal¬ 
ysis showed that Pearson’s correlation coefficients of the tests on the RLD 
waveforms were systematically below those of the LRA waveforms, whereas 
all other scores were larger for the RLD waveforms, thus indicating a poorer 
association of the waveforms obtained from the two measurement systems for 
the RLDs compared to the LRAs. As a result, the similarity between the RLD 
waveforms, obtained with the two Kinect sensors, is weaker than between the 
LRA waveforms. 

We also examined the similarity of the waveforms for the important angles, 
introduced in Subsection 3.1 of [3]. The only waveforms which match well are 
those for the hip and knee angles in the sagittal plane, and (to a lesser degree) 
those for the hip angles in the coronal plane. In the sagittal plane, the average 
values of Pearson’s correlation coefficients for the LRAs were equal to 0.972 
and 0.920 for the hip and knee angles, respectively. In the coronal plane, the 
average value of Pearson’s correlation coefficient for the LRAs was equal to 
0.873 for the hip angles. The corresponding RLD waveforms do not much well; 
in the sagittal plane, the average values of Pearson’s correlation coefficients 
were equal to 0.253 and 0.457; the agreement for the hip angles in the coronal 
plane was found to be even poorer (Pearson’s correlation coefficient came out 
around 0.293). The average value of Pearson’s correlation coefficients for the 
LRAs in the case of the knee angle in 3D was 0.918; for the corresponding 
RLDs, 0.314; the left and right knee angles in 3D are shown in Figs. 3 and 4. 

We hnally address the comparison of the RoMs obtained from the waveforms 
of two measurement systems waveforms. It might be argued that one could 
simply use in a study the RoMs, rather than the waveforms, as representative 
of the motion of each node. Of course, given that each waveform is essentially 
replaced by one number, the information content in the RoMs is drastically 
reduced compared to that contained in the waveforms. Shown in Fig. 5 is a 
scatter plot of the RoMs of the LRA waveforms. The ideal relation between 
these two quantities is a straight line with slope equal to 1; Pearson’s correla- 


9 



tion coefficient between the two RoM arrays came out equal to 0.995. Shown in 
Fig. 6 is a scatter plot of the RoMs of the RLD waveforms. A straight line with 
slope equal to 1 is the expected relation here too; however, the scattering of the 
values is larger and Pearson’s correlation coefficient dropped to 0.917. Again, 
we draw the conclusion that the similarity of the RLD waveforms, obtained 
with the two Kinect sensors, is smaller to that of the LRA waveforms. An 
unweighted quadratic £t, constrained to pass through the origin of the plot, 
to the RoMs of the LRA waveforms resulted in curvature and slope values 
equal to (—25.7 ±2.6) • 10“^ mm“^ and 1.259 ±0.016, respectively, indicating 
that (in comparison to the upgraded sensor) the original sensor overestimates 
(on average) the RoMs (in the domain of the values of Fig. 5) by about 12%. 
These results do not change significantly after the exclusion of the scores of 
the NWA. 


5 Discussion and conclusions 


The present study is part of a broader programme, aiming at investigating 
the involvement of the Microsoft Kinect"'"^ (hereafter, ‘Kinect’) sensor in the 
analysis of human motion. In this work, we set on assessing the similarity 
in the performance of the two (original and upgraded) versions of the sensor 
for data acquired from one subject walking and running on a commercially- 
available treadmill. An estimate of the degree of the association of the output 
of these systems is obtained after following the methodology of Ref. [3] and 
comparing the waveforms extracted for the nodes of the extremities from the 
two measurement systems, as well as of those for the important angles (defined 
in Subsection 3.1 therein). Our comparative study of the two Kinect sensors 
identifies the similarities and the differences in their performance, but (of 
course) cannot enable conclusions regarding which of the two sensors performs 
better. 

In this work, we came up with a number of significant differences in the per¬ 
formance of the two sensors. 

• The worst association is obtained for the waveforms of the shoulders (in all 
three spatial directions), of the hips (in the x and 2 ; directions), and of the 
knees (in the y direction). 

• The association of the waveforms in the two lateral directions {x and y) is 
poorer than that in the 2 : direction (depth). 

• The association of the waveforms deteriorates with decreasing velocity of 
the walker/runner. 

• The y waveforms of the lower legs come out different. A characteristic peak, 
which is present in the data of the original sensor shortly before the initial 
contact, appears reduced in the data of the upgraded sensor. 
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• Exempting the hip and knee angles in the sagittal plane, and the lateral 
hip angles in the coronal plane, the association of the waveforms of the 
important angles is poor. 

• The association of the ranges of motion (RoMs) is poor. It appears that the 
original sensor overestimates the RoMs (in the domain of the values of the 
present paper) by about 12%. 

• The association between the waveforms, entering the investigation of the 
asymmetry of the motion, is poorer than the results obtained for the average 
waveforms for the left and right parts of the human body. 

As the original sensor has been proven unsuitable for applications requiring 
high precision [2], it must be investigated whether the differences detailed 
herein constitute an improvement in the performance of the upgraded sensor. 
As a result, a dedicated validation study for the upgraded sensor, on the basis 
of a marker-based system, is called for. 
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Table 1 


The average values of the period of the gait cycle T, of the cadence C, and of the 
stride length L at the five velocity settings used in the data acquisition (see Section 
3), separately for the two Kinect sensors. 



5 km/h 

8 km/h 

9 km/h 

10 km/h 

11 km/h 

Original Kinect sensor 

r(s) 

1.0910(13) 

0.7391(17) 

0.7156(23) 

0.69544(78) 

0.6834(18) 

C (steps/min) 

109.99(13) 

162.36(37) 

167.69(54) 

172.55(19) 

175.60(46) 

L (m) 

1.5152(17) 

1.6425(37) 

1.7890(57) 

1.9318(22) 

2.0880(55) 

Upgraded Kinect sensor 

r(s) 

1.0896(23) 

0.7384(19) 

0.7160(19) 

0.6950(12) 

0.6826(17) 

C (steps/min) 

110.13(23) 

162.52(42) 

167.61(45) 

172.66(29) 

175.80(44) 

L (m) 

1.5134(32) 

1.6409(42) 

1.7899(48) 

1.9305(33) 

2.0858(52) 
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Kinect vl; left leg Kinect v2; left leg 




Fig. 1. Waveforms for the raw y coordinate of the left lower leg (ankle) obtained, 
separately for each velocity, from the two Kinect sensors. The quantity / is the 
fraction of the gait cycle. The difference in the values simply reflects the higher 
position of the upgraded sensor on the mount, see Section 3. 
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Kinect vl; right leg Kinect v2; right leg 
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Fig. 2. Same as Fig. 1 for the right lower leg (ankle). 
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Fig. 3. The waveforms for the left-knee angle in 3D obtained from the two Kinect 
sensors. The quantity / is the fraction of the gait cycle. 
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Fig. 4. Same as Fig. 3 for the right knee. 
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Fig. 5. The ranges of motion (RoMs) of the left/right average (LRA) waveforms, 
obtained with the original Kinect sensor, plotted versus those obtained with the 
upgraded Kinect sensor, at all velocity settings used in the data acquisition (see 
Section 3). 
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Fig. 6. Same as Fig. 5 for the ‘right-minus-left’ difference (RLD) waveforms. 
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